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Abstract. We study a special case of the vertex splitting model which is a re- 
cent model of randomly growing trees. For any finite maximum vertex degree D, 
we find a one parameter model, with parameter a € [0,1] which has a so-called 
Markov branching property. When D — oo we find a two parameter model with an 
additional parameter 7 S [0, 1] which also has this feature. In the case D — 3, the 
model bears resemblance to Ford's a-model of phylogenetic trees and when D — 00 
it is similar to its generalization, the a7-model. For a — 0, the model reduces to 
the well known model of preferential attachment. 

In the case a > 0, we prove convergence of the finite volume probability 
measures, generated by the growth rules, to a measure on infinite trees which is 
concentrated on the set of trees with a single spine. We show that the annealed 
Hausdorff dimension with respect to the infinite volume measure is 1/a. When 
7 = the model reduces to a model of growing caterpillar graphs in which case 
we prove that the Hausdorff dimension is almost surely l/a and that the spectral 
dimension is almost surely 2/(1 + a). We comment briefiy on the distribution of 
vertex degrees and correlations between degrees of neighbouring vertices. 
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1. Introduction 



Random trees are an important tool in many branches of science, 
ranging from quantum gravity models \13\ [25] to biological applications [H 
[TT] . to name a few. In this paper we introduce and study a new model of 
randomly growing, rooted, planar trees which we refer to as the attachment 
and grafting model, or ag-model for short. It is a special case of the vertex 
splitting model, recently introduced in [9]. The vertex splitting model is 
a modification of a model of growing trees, encountered in the theory of 
random RNA folding [llj. 

The ag-model is described informally below and a more detailed de- 
scription is given in Section [2l The root of the tree is simply a marked 
vertex of degree one and the planarity condition means that edges are or- 
dered around vertices. The parameters of the model are a, 7 G [0, 1] and 
D € {3, 4, . . .} U {00} denotes the maximum degree of vertices in the trees. 
When D < 00, a and D are the only active parameters of the model but 
when D = 00, 7 also plays a role. Define 



The growth rules can be explained as follows. Call the edges which are 
adjacent to vertices of degree one (besides the root) leaves and call the 
other edges internal edges. In each discrete time step a new edge is added 
by randomly selecting 

(a) a vertex of degree k > 2 with relative probability Wk and attaching 
a new edge to it (the k possibilities of attaching chosen uniformly at 
random) or 

(b) an inner edge with relative probability a and dividing it into two edges 
by grafting a vertex to it or 

(c) a leaf with relative probability 1 — a and dividing it into two edges by 
grafting a vertex to it, 

see Fig. [T] (left). It is from these two operations 'attachment' and 'grafting' 
which the model gets its name. When a = there is no grafting and the 
model reduces to the model of preferential attachment (see e.g. [21[TU]) with 
linear attachment kernel Wk- When a = 1 there is no attaching and we 
simply have a growing linear graph. 

The ag-model closely resembles the a7-model which was introduced 
in [8]. In the a7-model a new leaf is added in each time step by randomly 
selecting 

(a) a vertex of degree A; > 3 with relative probability {k — 2)a — 7 and 
attaching a new edge to it or 

(b) an inner edge with relative probability 7 and grafting a leaf to it or 




if -D < 00 
if D = 00 



(1.1) 



and 




for k<D 
for k> D. 



(1.2) 
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Figure 1. Growth rules of the ag-model (left) and the a7-model 
(right). The root is indicated by a cirlced vertex. 



(c) a leaf with relative probability 1 — a and grafting a leaf to it 

where < 7 < a < 1, see Fig. [1] (right). In the case 7 = 0, this model 
reduces to Ford's a-model of growing binary trees [17| in which case it 
resembles the ag-model with D = 3. When 7 < a it is similar to the 
ag-model with D = 00. 

The a7-model and the ag-model both have a property referred to 
as Markov branching which was introduced by Aldous in [2]. This means, 
crudely, that the subtrees below a given vertex have the same distribution as 
the whole tree, see Section [3l This feature makes the models much simpler 
to treat, since one can easily write recursion equations for many observables. 
Furthermore, recent results by Haas and Miermont provide a recipe for 
taking the scaling limit of such models |18] . 

The main results of this paper are the following. For q > 0, as the 
size of the trees goes to infinity, the measure concentrates on the set of trees 
with exactly one non-backtracking path from the root to infinity, referred 
to as an infinite spine. The emergence of a unique infinite spine is known 
in other models of random trees, an example being the uniform planar tree 
and modifications of it, see e.g. [14J. Similar effects are also observed in 
triangulation models in quantum gravity, where exactly one large "universe" 
appears with finite baby-universes attached, see e.g. [23]. We also establish 
that the average volume of a graph ball of radius R in the infinite trees 
grows like B}/". The exponent is referred to as the Hausdorff dimension 
and denoted by dfj- This power law behaviour is interesting since it is often 
the case that models of growing trees exhibit an exponential volume growth. 
This is e.g. the case in the preferential attachment model ^ which in fact 
corresponds to the case q = as was noted before. Furthermore, since 
a G [0,1], the full range of exponents, dn, is realized and the ag-model is 
one of few known natural tree models having this feature. 

1.1. Relation to the vertex splitting model. We now briefly introduce 
the vertex splitting model and show how the ag-model can be seen as a 
special case. The parameters of the vertex splitting model are given by a 
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set of non-negative weights Wij, with 1 < i, j,i + j — 2 < D, where D > 2 
(or D = oo) is a fixed number which denotes the maximum vertex degree 
in the trees. These weights are referred to as partitioning weights and the 
so-cahed splitting weights are defined by 

i ''^^ 

Wi = - '^ 1 <i < D. (1.3) 

i=i 

Starting from a fixed finite planar tree, in each discrete time step a new edge 
is added as foUows. 

(a) Select a given vertex v of degree i with relative probability Wi. 

(b) Randomly partition the edges which contain v into two disjoint sets 
of adjacent edges: V of size k — 1 and V of size i — {k — 1), with 
probability Wk,i+2-k/wi- For a given k, all such partitionings are taken 
to be equally likely. 

(c) Move all edges in V' from w to a new vertex v' and join v and v' by a 
new edge. 

We allow a small generalization of the above growth rule: single out a vertex 
of degree one in the initial tree and call it the root and modify (a) in such 
a way that the root is selected with relative probability tt^r > 0. Each time 
the root is split we define the new vertex of degree one to be the root. The 
vertex splitting model has very general growth rules and it includes many 
other models of random trees as special cases or limiting cases, see [HI [29] 
for more detailed discussion. 

The ag-model can be recovered from the vertex splitting model by 
assigning the weight Wr = ct/2 to splitting the root and choosing the nonzero 
partitioning weights as follows 

W2,2 = a, 

(X 

Wk,2 = 3< k< D, 

Wk+1,1 = ^, 2<k<D. (1.4) 

The splitting weights are then 

Wf^= (^'^ + ajk + l-2a-a, 2<k<D (1.5) 

and Wk = k > D. Note that in the case D = oo and 7 < a/2, the weight 
wi is negative. We will however include this case in the ag-model since the 
total weight of any transition is still positive. 

A similar relationship between the a7-model and the vertex splitting 
model was discussed in [9j. However, in that case one needs to take w^^i = 00 
which means that comparison of results in the two models is not necessarily 
reliable. The ag-model is therefore more interesting as a special case and 
due to its simplicity, yet non-triviality, it serves as a good testing ground 
for non-rigorous results obtained in the vertex splitting model. 
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1.2. Outline. The paper is organized as follows. In Section [2] we define 
rooted planar trees and introduce a convenient notation for representing 
random trees. Thereafter we give a proper definition of the ag-model which 
was described informally above. In Section [3] we show that the model has 
the Markov branching property and we calculate its first split distribution. 
In Section [4] we show, using methods from [28], that the finite volume prob- 
ability measures generated by the random growth operation, converge to a 
measure on the set of infinite trees. Furthermore, we characterize the infinite 
volume measure. In Section [5] we calculate the annealed Hausdorff dimen- 
sion with respect to the infinite volume measure and in a certain special 
case, we calculate the almost sure Hausdorff and spectral dimensions. The 
results we obtain, support certain scaling assumptions which were made in 
the vertex splitting model. We conclude by commenting on the distribu- 
tion of the degrees of vertices in the trees and correlations between degrees 
of neighbouring vertices by recalling results from [9j. In order to improve 
readability, proofs of theorems and lemmas are in most cases collected in 
Appendix B. 



In this section we begin by defining the set of rooted, planar trees 
and endow it with a metric. Then we define a convenient notation for 
representing random trees and introduce the model which will be studied in 
the paper. 

Start with a tree graph r which has vertices of finite or countably 
infinite degree and at least one vertex of degree one. By convention we 
define the root r of r to be a vertex of degree one and we label the unique 
nearest neighbour of the root by (1). The rest of the vertices are labeled 
in the following recursive way. The children of a given vertex in the tree 
(apart from r) with label {£) are labeled with sequences {£, 1), (£, 2), . . ., see 
Fig. [2j A rooted planar tree is a tree r along with such a lexicographical 
labeling. From here on, we will always work with rooted, planar trees unless 
otherwise stated and will simply refer to them as trees. We denote the set 
of trees with n edges by Tn and the set of all trees, finite and infinite, by T. 

A tree tq is said to be a left subtree of r if it is a connected subtree 
of T which contains r and has the properties that if it contains a vertex 
with label (l, k) then it contains all vertices with labels {i, i) with i < k, see 
Fig. [2j Let Bii{t) be the graph ball of radius R centered on the root of r. 
We define the left hall of radius i?, Lfi{T), as the maximal left subtree of 
Br{t) with vertices of degree no greater than R, see Fig. [2l A metric d is 
defined on T by 



2. Random planar trees 
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(1,1,2.1) (1,1,2,2) (1,,3.2,1) (1.3,2,2) 




Figure 2. Left: An example of a rooted, planar tree r and a left 
subtree tq (boxed in gray). Right: The graph ball B3(t) and the 
left ball L3{t) (boxed in gray). The root is indicated by a circled 
vertex. 



The metric d was first introduced in [2Tj and we refer to this paper for some 
properties of the metric space {T,d). 

Define the root joining operation * in the following way. Given trees 
Ti, T2, . . . , Tfc, k > 1, let r = * Ti * T2 * • • • * Tfc be the tree obtained by (I) 
identifying the roots of ri, T2, . . . , and labeling them by (1), (II) replacing 
the first element '1' of each label in tj by '1, j', 1 < i < A;, and (III) 
connecting a new root r to the vertex (1). If A; > 1 we may omit the 
symbol, see Fig. [3l Note that in general 

n * T2 * • • • * Tfe / T^(i) * T^(2) * • • • * T^(fc) 

for a permutation a of {1, 2, ... , A;}. 




* Ti Ti * T2 * ■ ■ ■ * Tk 



Figure 3. The root joining operation. 



Let TTn be a probability distribution on Tn- We define a random tree 
Tn by the canonical probability generating function 

Tn=Yl ''n{r)T. (2.2) 
rer„ 

The above sum of trees and multiplication of trees by a scalar are formal and 
provide a convenient way of storing information on the probability measure 

2.1. The ag— model. Using the notation introduced above, we now define 
the ag-model which was described informally in the introduction. Let 



W{n) = n — a. 
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(2.3) 



We introduce a growth operation / in the fohowing recursive way. Let s be 
the single edge tree and define I{s) = * s. For a tree r = * n * • • • * rfc_i 
define the random tree 

1 f"^ 

= W{\t\) \ ^ ^^l^^l^ » n * • • • * I{Ti) *---*Tk-l 



+ a * r H — — n * • • • * rj_i * s * Ti* ■ ■ ■ * t^^i I 
i=i / 

(2.4) 

where |r| denotes the number of edges in r. The growth operation / is 
equivalent to the growth rule which was described informally in Fig. [1] (left) 
in the introduction. The ag-model is defined recursively as the random tree 
Pn which satisfies Pi = s and 

Pn = I{Pn~l). (2.5) 

We denote the probability measure on Tn , generated by this growth process 
by fn- 

3. Markov branching 

A sequence of random trees (r„)„>i is said to satisfy a Markov branch- 
ing property, or to be Markovian self-similar, if there exist functions Qkini, . . . , n^.i), 
k >2 such that for all n > 2 

oo 

Tn = '^ ^ qk{ni,...,nk-i) 9*Tnj^*---*Tn^_^. (3.1) 

k=2 niH |-nfe_i=n— 1 

The functions qkin-i, . . . ,nfc_i) are referred to as the first split distribution 
of {Tn)n>i- We use the convention that Qkin-i, . . . ,nk-i) = if any of the 
arguments ni, . . . , rik-i equals zero. 

Proposition 3.1. The random trees {Pn)n>i, defined by \2. 5|) . have the 
Markov branching property with a first split distribution which satisfies 92(1) = 

92(n) = :^(t^(n-l)g2(n-l) + a), (3.2) 
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gfc(ni, . . . ,nfc_i) = ^^^^ [ Wijii - l)gfc(ni, . . . ,ni - 1, . . . ,nfc-i) 
+ ^ _ -^ Oni,iqk-i[ni, . . . ,ni_i,ni+i, . . . ,nk-i) 



for 3 < k < D anc 



(3.3) 

qn+iini,. . . ,nD) = (3.4) 
7 



where ni + • • • + = n. 

Proof. We use induction on n. Pn clearly satisfies ()3.ip for n = 2. Assume 
it satisfies (13.11) for some n. Then 



n+l 



oo ^ / 

X] qk{nu...,nk-i)—-ia^*{9*Pn,*---*Pn^_,] 

A:=2niH |-n.fe_i=n— 1 V 



H ^ Z] Pni * • • • * -Praj_i * S * P^i * • • • * Prik-i 
i=l 

k-l \ 

+ ^*Pn,*---* I{Pn,) *■■■* Pn,^, 

i=l J 

^ ia + Win-l)q2in-l))9*Pn 



W{n) 

OD ^ k—l I 

k=3ni-i hnfe_i=n ^ ' j=l \ 

+ W{ni - l)qk{ni, . . . ,ni - 1,. . .,nk-i) * * • • • * Pn^.i- 



This shows that (13. ip also holds for n + 1 and we conclude that it holds for 
all n > 2. □ 

The recursions for the first split distribution in Proposition 13 . 1 1 can be 
solved with straightforward methods. We state the result in the following 
proposition which can easily be proved by induction. The method for finding 
the solution is described in Appendix A. 

Proposition 3.2. The first split distribution of the sequence {Pn)n>i is 
given by 

r(fc-2 + i^)r(i-a)r(n + i) ^^^ «r(n,-a) 

qk[ni,...,nk-i) ^ (i^) r(A:) aT{n+l-a) H T (1 - a) T (n, + 1) 

/ k-l \ 

rii \ 



1 — a — a + a 

^ n + 1 



(3.5) 

where ni + • • • + n^.i = n. 

We will repeatedly use the following standard, easily derived identities 
when we work with the above first split distribution [1] 

y , = 1 - (1 - (3.6) 

;^^r(l-c)r(n+l) ^ ^ ^ ^ 



and 



(3.7) 



4. Convergence of the finite volume measures 

In this section we show that the measures generated by the growth 
process converge weakly to a measure v on the set of infinite trees. By weak 
convergence we mean that for ah bounded functions / which are continuous 
in the topology generated by the metric d 

j f{T)dun — > j f{T)dv, as n — > oo. (4.1) 

We will call an infinite non-backtracking path from the root, a spine. 
Let T be a tree with exactly one spine and let u be a vertex on the spine 
{v 7^ r) with degree k. We call the k — 2 finite subtrees of r which are 
attached to the vertex v outgrowths from the spine. 

Theorem 4.1. Let a > 0. The measures Vn, viewed as probability measures 
on T , converge weakly, as n — > oo, to a probability measure v which is 
concentrated on the set of trees that have exactly one spine. The degrees of 
the vertices on the spine are independently distributed by 

aT (^)T (k-2 + ^) , , 

= \ ° \ \ ^-TT-^, k>2. 4.2 

n ) r(i^)r(fe-i + i) ' - 

The outgrowths from the spine are finite with probability one and outgrowths 
from different vertices are independently distributed. If a vertex v on the 
spine has degree k and ti, . . . ,tl are the outgrowths from v to the left of the 
spine (in that order) and r^+i, . . . , Tfc_2 are the outgrowths from v to the 
right of the spine (in that order), then their joint distribution is 

^_ r{k-l + l) 1 ar{\Ti\-a) 

i^kin, Tk-2) - ^ r(fe - 1) 1 + m l\ r (1 - a) r(|T,| + 1) ""i^^i 

(4.3) 

where m = |ti| + • • • + |Tfc_2| > k — 2 and k > 3. 

A proof to the above theorem is given on page 1211 in Appendix B. 

We point out that the distributions fik are independent of how many 
of the outgrowths are to the left or to the right of the spine. For an ordered 
sequence of /c — 2 outgrowths, there are k — 1 different ways to arrange them 
around the spine. 

Below, we comment on some special cases. When D < oo, (j){k) = 
for k > D as it should be. In the case D = 3 and a = 1/2 the trees are 
generic, i.e. Vn is a critical Galton-Watson process conditioned to have n 
edges. This follows from the fact that the first split distributions can, in 
this case, be written as 

I2{n) = ]:^^ and 93(^1, ra2) = j ^"^^"^ (4.4) 



with 

Zn can be interpreted as a finite volume partition function corresponding to 
brandling weights tui = t^s = 1/4 and 11)2 = 1/2, see e.g. [14j. Furthermore, 
this is the only special case in which we obtain generic trees. This can be 
seen from the fact that when D > 3, outgrowths from the same vertex on 
the spine are dependent. 

When D = 00 and 7 = 1, 

(l){k) = a{l - a)^-^ (4.6) 

and it falls off exponentially in k. When D = 00 and 7 < 1 we find that for 
large k 

aT (^) 

0(A;) = —J^i^fc^i-A (l + C)(A:-i)) (4.7) 



i.e. it falls off with a power law in k. From the last formula, we see that 
when a < 1 — 7, the expected value of the degree of a vertex on the spine 
is infinite. A simple and interesting special case arises when 7 = in which 
case the outgrowths from the spine are single leaves. Such graphs have been 
referred to as caterpillars in the literature. The degrees of the vertices on 
the spine are distributed independently by 

aT{k-l - a) 

= r(fc)r(i-a) (^-'^ 

and they have an infinite expected value for all values of a. These caterpillars 
are a special case of 'caterpillars at a phase transition' in the equilibrium 
statistical mechanical model studied in [23]. We will consider this special 
case in more detail in the next section. 

5. The Hausdorff dimension 

The Hausdorff dimension is a notion of dimension of graphs and is 
defined in terms of how the volume of the graph ball scales with its 
radius R. The Hausdorff dimension of a graph G is defined as 

^ MIMM (5,) 

R^oo log{R) 

provided that the limit exists. This definition is only interesting on an 
infinite graph. On the hyper-cubic lattice Z"^ it holds that dn = d but in 
general dn is not an integer. This dimension has been studied by physicists, 
especially in the quantum gravity literature, see e.g. [5] and should not be 
confused with the usual notion of Hausdorff dimension in a metric space, 
although there are some similarities. 

The Hausdorff dimension can be defined in different ways for random 
graphs. If the graphs are distributed by u then they might first of all have, 

10 



I'-almost surely, a HausdorfF dimension dn as defined above. Secondly, we 
define the annealed Hausdorff dimension as 

R^oo log(i?) 

where denotes expected value with respect to i/. 

There is another notion of dimensionality which applies when one 
considers a sequence of finite volume measures {i'n)n>o on a set of graphs. It 
is usually defined in terms of how the average value of some typical distance 
in the graph (the maximum distance between vertices, the mean distance of 
vertices from the root, etc.) scales in relation to the volume of the graph 
n as it grows. This dimension has also been referred to as the Hausdorff 
dimension in the physics literature but to avoid confusion we will refer to it 
here as the fractal dimension and denote it by d/. To give a more precise 
definition, we adopt the one from [9j which is as follows: Define the radius 
of a finite tree T by 

'RT = i dT{r,v)aT{v) (5.3) 

where V{T) is the vertex set of T, r is the root, dx is the graph metric and 
axiv) denotes the degree of v. The fractal dimension is defined as 

= ,i,„ ^^g^-. (5.4) 

n^oolog((i?T)^„) 

If I'n converge to a measure concentrated on infinite graphs, df has been 
observed to be equal to dn (or dn) in many situations, a simple example is 
the uniform tree and modifications of it, see e.g. ^14j. It is however straight- 
forward to find a counterexample where dn ¥^ df and it is not entirely clear 
which conditions guarantee equality. We will comment on this relation in 
the ag-model below. 

We will now calculate the annealed Hausdorff dimension of the trees 
distributed by from Theorem 14.11 

Theorem 5.1. Let max{0, a} < a < 1. The random trees, distributed by v 
described in Theorem \4-l\ have an annealed Hausdorff dimension 

dH = -. (5.5) 

a 

To prove Theorem 15. H we need to analyse the large R behaviour of 
{\Br\)u- In order to simplify the notation we let (0) be the empty tree and 
define ^2((0)) = 1- We then extend the probability distributions Hk, k > 2, 
to probability distributions on |Jfc^2 T'^~^ and define 

oo 

l^ = Y4>{k)fJ'k- (5.6) 

k=2 
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Since the outgrowths from different vertices on the spine are i.i.d. it is clearly 
sufficient to show that 

Y,\Bn{n)\) =i?i/"-i(l + o(l)) (5.7) 

as R — > oo. This follows from the Lemma below which is proved on page 
[Ml in Appendix B. 

Lemma 5.2. For max{0,a} < a < 1, 

ViB wa\ - (-« + ^)r(^)r(fi+^) , 
\.\Bdn)\)^ - r(i?)r(fl + ^) 

Note that dn = oo when D = oo and a < 1 — 7 (a < max{0, a}) 
since then the expected value of degrees of vertices on the spine is infinite. 
However, the z^-almost sure Hausdorff dimension might still be finite. We 
confirm this in the case D = 00 and 7 = 0, when the trees are caterpillars. 

Theorem 5.3. Let < a < 1 and 7 = 0. Then 

dH = - (5.8) 
a 

u -almost surely. 

The proof is given on page 1271 in Appendix B. 



5.1. Comparison to the fractal dimension. In the original paper on the 
vertex splitting model [9] it was shown that the expected value of the radius 
of a tree can be written as 

+ 1 " 

(^r).„ = \^ Yl (^^^2 + 1) qk{n - na, ns) (5.9) 

712=0 k 

where q'fc(rai,n2) is the probability that a uniformly chosen vertex v has de- 
gree k and that the volume of the subtree attached to v containing the root 
is ni and that the other subtrees attached to v have a total volume 712. Fur- 
thermore, in the case of linear splitting weights Wi = Ai + B, qkini,n2) was 
shown to be a solution of a system of linear recursion equations determined 
by the growth rules of the vertex splitting model, see [91 Section 3]. These 
recursion equations could not be solved explicitly but it was assumed that 
the following scaling holds 

qk{ni,n2) ~ (ni + n2)''^'^^ujk{ni/{ni + 7x2)) (5.10) 

for some A and "scaling functions" uj^. The linear recursions were thus 
reduced to an eigenvalue equation for A 



Cx = Ax 
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(5.11) 



where A is the Perron-Frobenius eigenvalue of the (^) x (^) matrix C 
indexed by a pair of two indices ki, 2 < k < D, 2 < i < k, and given by the 
matrix elements 

Cki,jn = —{Wkj+2-k{{j - 'i)Si,n + iSn,j-k+i) " WkSk,jSi,n)- (5-12) 

Comparing (j5.9p and (jS.lOp to (|5.4p allows one to find the fractal dimension 

df = X-^. (5.13) 

The scaling assumption ()5.10p was not proven but the results ()5. 11115. 13]) 
were supported by simulations in the case D = 3. 

It is interesting to compare df, corresponding to the weights (jl.4p of 
the ag-model, to the values of dn obtained in Theorem 15.11 It is straight- 
forward to solve (j5.1ip for small values of D and find that df = dn = l/a- 
Furthermore, we have calculated df in the case D = oo and 7 = 1 by solv- 
ing ()5.1ip numerically. We used a cutoff D = 30 on the system which is 
expected to closely approximate the case D = 00, since the vertex degree 
distribution is believed to fall of exponentially in this case, cf. (j6.4p . The 
results are shown in Fig. HI The agreement we find, supports the validity of 
the scaling assumption ()5.10p to a very high maximum degree D. 




0-1 ■ , ■ 1 ■ 1 ' 1 ■ 1 

0.2 0.4 0.6 0.8 1 

a 



Figure 4. Comparison of d/ (gray squares) and dn = 1/a (solid 
line) in the case D = 00 and 7=1. Using the weights (|1.4[) . we 
calculated df numerically from ()5.im5.13l) for a = i/ 100, I < i < 
100, using a cutoff I? = 30 on the system. 



5.2. The spectral dimension. We conclude this section by mentioning 
another notion of dimension of graphs called the spectral dimension. It is 
defined in terms of how the return probability of a random walker on the 
graph decays with time t. More precisely, for a tree r let Prit) be the 
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probability that a simple random walk which leaves the root at time t = 
is back at the root at time t. The spectral dimension of r is defined as 

= _ ^Y';^^)) (5.14, 

t->oo log(t) ^ ' 

provided the limit exist. The spectral dimension can take any value greater 
than one and does not necessarily agree with the Hausdorff dimension. We 
refer to [U [T^ \TE[ [22] for discussion of the spectral dimension of several 
types of random graphs. 

It would be interesting to calculate the spectral dimension of the trees 
distributed by v. For now we only have results in the case when the trees 
are caterpillars. 

Theorem 5.4. Let < q < 1 and 7 = 0. If dg exists then 

ds = (5.15) 
1 + a 

v~ almost surely 

The theorem is proved on page 1281 in Appendix B. 

6. Vertex degree distribution and correlations 

In this section we use results from the vertex splitting model [9] to 
calculate the vertex degree distribution and correlation between the degrees 
of neighbouring vertices in the ag~model. Not all results in this section are 
rigorous and we will comment on this point below. 

Let Xi^n be the number of vertices of degree i in a random tree with 
n edges and define the vertex degree densities 

Pi = hm (6.1) 

n— ^-oo fi 

It was shown in [9l Section 2] that the densities in the vertex splitting model 
satisfy the linear equation 



-p 



+ E (6-2) 



assuming that the splitting weights are linear wi = Ai + B and under certain 
technical conditions on the partitioning weights. The splitting weights are 
linear in the ag~model, cf. (|1.5p . and one can check that for small D the tech- 
nical conditions needed on the partitioning weights are fulfilled. However, it 
is not certain whether (j6.2p holds for general D and one would need further 
analysis to verify that. It is straightforward to solve (|6.2p for the weights 
(jl.4p and thus we find that the vertex degree densities in the ag-model are 
given by 

1 — a , , 

Pi = 6.3 

I — a — a 

14 



(1 - a)T («+2^) T(k-2 + ^) 
Pk = ^ ^ \ — , " % , , k>2 6.4 



and 



provided that (|6.2p holds. By sending a to zero we find that these results 
agree with results previously obtained in the preferential attachment model 
[21 [26]. Also note, that in the case D = oo, pk has in general a power law 
behaviour except when 7 = 1 (a = 0) in which case it falls of exponentially 
with rate (1— Q!)/(2— a). This resembles properties of the degree distribution 
of the vertices on the spine, cf. (j4.6p and (|4.7p . 

Let Xij^n be the number of edges with endpoints of degree i and j in 
a random tree with n edges, using the convention that the vertex of degree 
i is the one closer to the root. Define the density 

Pij = hm ^ — ^i^. (6.5) 

n— )-oo n 



It was shown in [9j that these densities in the vertex splitting model satisfy 

'-Pik 



W2 W2 .^T"-', W2 

t>j-l 

+ {k-i)Y: ^^^^^p.. (6.6) 

i>k-l 

assuming that the limit (j6.5p exists. The densities give us information about 
correlation between vertex degrees of neighbouring vertices. It can be mea- 
sured with a correlation coefficient 

Ej,kU - - ^)iPjk - PjPk) 
r = ^ (6.7) 

E,(j-im-(E,(j-i)p. 

where 

kpk , - Pij + Pji 

Pk = ^ . and Pij = \ ■ (6.8) 

Li I'Pk 2 

The coefficient r takes values between —1 and 1. If r < the graph is said 
to show disassortative mixing and vertices with high degree prefer to be 
neigbours of vertices with low degree. If r > the graphs are said to show 
assortative mixing and vertices with high degree prefer to be neighbours of 
vertices with high degree, see e.g. [27] . 

We will conlude this section by calculating r for two choices of param- 
eters in the ag-model, namely D = 3 and D = oo, 7 = 1. We consider (j6.6p 
with the weights given in (jl.4p . In the case D = 3, (j6.6p can be explicitly 
solved and we find that 

r = - , "('-'"^ {D = 3). (6.9) 

(5 — 3q)(3 — a) 

This can of course be repeated for small values of D. However, when D is 
large or infinite it is more difficult to solve (|6.6p explicitly. Instead, we study 
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the generating function 



Six,y)= ^ p,kx'-V~' (6.10) 

j,k>2 

and use the fact that 

d.dySil, 1) = - - ^)Pjk (6.11) 

to calculate r. Equation (j6.6p becomes a linear, first order partial differential 
equation in terms of the generating function S{x,y). It can in principle be 
solved for a general set of parameters, however we only comment on the case 
-D = oo, 7 = 1. In that case, the coefficients of the derivative terms in the 
PDE are zero and we get an ordinary equation for S{x,y). The solution is 

S{x, y) = + xf{y) + yg{x) + p22xy (6.12 

\ — ax — ay 



where 



/(y) = 5^m/-' and g{x) = Y,Pj2X^-\ (6.13) 
fe=3 i=3 

2(1 - Sa + a^) a ^ , ^ ^ 

= (l-a)(2-a)^ " +(13^^' '^2' 

2a^ - 15a^ + 38a^ - 37a + 8 + (2a^ - 6a + 4)j ^ 

~ (l-a)2(2-a)2 

2a2 _ 6a + 2 + aj . 
+ r, ^ V> J>3, 



(1-a; 



with 



7] = — and a = . (6.14) 

2 — a 3 — a 



From these expressions we find that 

r- = - ^, " , (i^ = oo,7 = l). (6.15) 
2(1 + a) 

We plot the solutions (j6.9p and (|6.15|) together in Fig. [5J The two curves 
are similar in both cases. If a = then r = which agrees with results 
which have previously been obtained for the preferential attachment model 
[27]. In this case the vertices which are close to the root are 'old' in the 
sense that once they reach the maximum degree (which they eventually do 
with probability one) they do not change again. Thus, a lot of vertices of 
high degree become neigbours. When a is increased above zero, a repulsions 
is introduced between these vertices, the value of r decreases and the trees 
show disassortative mixing. When a goes to 1, the trees approach the same 
non-random graph, a spine with no outgrowths. As a consequence, the value 
of r approaches the same value in both cases. 
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-0.20 




Figure 5. Comparison of Equations (|6.9p {D = 3, black) and 



dHSl) (I? = (X), 7 = 1, gray). 



7. Conclusions 



We introduced the ag-model, a special case of the vertex splitting 
model which has the Markov branching property. For particular choices of 
parameters it reduces to models of generic trees jl4j , preferential attachment 
p] and non-generic caterpillars |23j . It was proved that the finite volume 
measures generated by the growth rules converge to a measure which is 
concentrated on the set of trees with exactly one spine and the limiting 
measure was described explicitly. The same has been done before in Ford's 
a-model |28j and a special case of the a7-model [29]. Extension of these 
convergence results to the vertex splitting model is a work in progress. 

There is another notion of convergence of random trees, referred to as 
the scaling limit, see e.g. [3l[l9]. This means, roughly, that a random tree 
Tn viewed as a metric space with the graph metric dgr suitably scaled, con- 
verges weakly, in the Gromov-Hausdorff topology, to a continuum random 
tree. In a recent paper on Markov branching trees [18], Haas and Miermont 
proved that under certain natural conditions on the first split distributions 
the scaling limit of the trees is a self-similar fragmentation tree, in the 
Gromov-Hausdorff-Prokhorov topology. We expect that this theory applies 
to the model studied in this paper and it would be interesting to confirm 
that. Moreover, it is an interesting and challenging problem to generalize 
the results on the scaling limit to the vertex splitting model when Markov 
branching is absent. 

The annealed Hausdorff dimension, with respect to the infinite volume 
measure of the ag-model, was calculated for a certain range of the parame- 
ters. The results partly support scaling assumptions which were made when 
calculating the fractal dimension in the vertex splitting model [9]. In the 
special case of growing caterpillar graphs we calculated, almost surely, the 
Hausdorff and spectral dimension. It turns out that the dimensions are 
related by the formula 



2dH 




l + dn 
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This equation holds in general for tree models which satisfy a certain unifor- 
mity condition and under the assumption that vertex degrees are uniformly 
bounded from the above We expect this relation to hold in the ag~model 
and it would be desirable to check whether it holds in the vertex splitting 
model. 

It is possible to study other interesting observables in the ag-model 
such as the vertex degree distribution and correlations between degrees of 
neighbouring vertices in large trees. It would be interesting to give a rigorous 
proof of (16. 4p and even to get stronger convergence results for the random 
variables Xi^n- This can presumably be done, at least for some range of 
the parameters, using results on generalized Polya urns [20]. Furthermore, 
it would be interesting to confirm the validity of (|6.2p for as general set of 
parameters as possible. Similar results about the convergence of Xij^n are 
desirable. 

A natural question is whether the ag-model is the only special case of 
the vertex splitting model which has the Markov branching property. As was 
noted in the introduction, the a7-model has the Markov branching property 
but it is not strictly a special case of the vertex splitting model, rather a 
limiting case. Since the vertex splitting model has local and isotropic growth 
rules, one might also ask whether there exists some other notion of self- 
similarity which could be used to handle the general case. An understanding 
of this could be a key element towards a solution of the most general case. 

Acknowledgement. I am deeply indebted to Thordur Jonsson and Frangois 
David for helpful discussions and comments. 

Appendices 

Appendix A. Solution of the first split distributions 

In this section we describe a 'network flow method' for solving the 
recursion equations for the first split distribution given in Proposition 13.11 
We encountered this method in [T7j where it was used to solve recursion 
equations for the first split distribution of Ford's a-model. 

First of all, it is straightforward to derive (|3.5p in the case k = 2. 
Consider next the case A; = 3 in which case the nearest neighbour of the 
root has two disjoint subtrees of descendants which we refer to as the left 
and right subtree. We represent a state when the tree has rii edges in the 
left subtree and n2 edges in the right subtree by a node (ni,n2), ni,n2 > 1 
in the network in Fig. [6l We assign conductance 

C((ni - l,n2) ^ = (A.l) 

ni + n2 - a 

between states (ni — l,n2) and (^^l,?^-2), which is the probability of going 
from state (ni — l,n2) to state (ni,n2) (symmetric in ni and 712) given 
by the recursion (j3.3p . Let uj = (wq, wi, . . . , a;|^|) be a simple path (non- 
backtracking path) in the network with endpoints ujq and Wj^^i where |a;| 
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Figure 6. A network flow diagram with sources Xi, z > 1. 



denotes the length of the path. Note that the product of conductance along 
a path u from a state (z, 1) (or (l,"*)) to a state (ni,n2) is given by 

n''^' (i - a)(^ + 1 - o)r (m - g) r (na - a) 
. , ^^"^-^ ^ "^-^ = r(ni + n, + l-a)r(l-a) ^^-'^ 

j=i 

and is independent of the path chosen. We define the value of a state (ni , ri'i) 
as (/3(ni,n2) and we define as a source which flows into state (ni, 1) (or 
(l,ni)). We can write 

93(^1,712) = C((ni - l,n2) (ni,n2))g3(?^i - l,n2)(l - 

+ C((ni, 71,2 - 1) (^ll, 'ra2))g3(«l, "2 - 1)(1 - (^na.l) 

(A.3) 

i.e. g3(ni,n2) is given by the flow from the neighbouring sources and the 
neighbouring states, weighted by the conductance between the states. By 
comparing ()A.3P to (|3.3p we find that 

.. = ,3(l.l) = l-,.(2) = i^ (A.4) 

I — a 

and 

_ W2q2{ni) _ {1 - a){ani - a + 1 - a) 
"^"^ ~ 2Wini + 1) ~ 2(ni + l-a)(ni-a) ^ 

for ni > 2. 

From (IX3I) we conclude that (?3(ni,n2) is given by the sum over all 
paths from all sources which lead to the state (ni,n2), where each path is 
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weighted by the product of the conductance along the path, i.e. 

q3{ni,n2) = ^ C(u;j-i Wj) 

i=l paths ui from (i, 1) j=l 
to (ni, 712) 

«=2 paths from (1, i) j=l 
to (ni, 712) 

Since the conductance along a path between given states is indepen- 
dent of the path chosen, we can take the product outside the inner sum and 
we are simply left with a counting problem. The number of paths between 
(i, 1) and (ni, 712) is {"'^~^ni-i~^) number of paths between (1, i) and 

(ni,n2) is ■ We can now easily perform the sums over i and we 

recover ()3.5p for A: = 3. 

This argument can be generalized to higher values of k and it yields 
the formula (13. 5h. 



Appendix B. Proof of main theorems 

In this section we collect proofs of theorems and lemmas stated in the 
main part of the paper. We need the following two lemmas in the proof of 
Theorem 14.11 

Lemma B.l. For n > k, 

r(l-a)r(A:-2 + i^) ^ / _ aai \ {-ly+^T (n - 1 - ai) 

r(i^)r(n-a) " ai-a + lJTii)T{k-^)T{l-ai)^ 

{B.l) 

Proof. The proof follows from a generating function argument. Define 

nr(i-a)r(n. + i) (i-«-" + «Ei + ^ )• 

n.i>l 

(B.2) 

Then 

V- , , r(fc-2 + ^)r(i-a)r(n) ^ 

^ E ^'=("^'•••'"^-1)= r(i^)r(fe)ar(n-a) 

jiiH hr!.fe_i=ri-l \ a J y ^ v ^ 

(B.3) 
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By 

oo 

^A^,,C = C((l-a-a)(l-(l-cr)' 

i)(i-cr-'£(i-(i-c'r)'"'<) 



n=k 



/ aar(fc)r(l±^) 



2 = 



(^-1 + ^) 

ai — a + l/ V ^ 



(B.4) 

From the last expression we can determine the coefficients ^n,fc and inserting 
them into (|B.3|1 completes the proof. □ 

Lemma B.2. For a > 0, 

oo 



lim gfe(ni, . . . ,nfc_i) 

l^OO ^ ' ^ ' 

k=2 niH hnfe_i=n— 1 

CXD 

Elim gfc(ni, . . . ,nfc_i) = 1. (B.5) 



n— >-oo 

fc=2 niH hnfe-i=n— 1 



Proof. We obtain the following limit from (jB.ip 

lim gfc(ni,...,nfc_i) = "^1^2^^"^,^^ ^^"^^ 

™„,+...+tr:,=„_i r (i^) r (fc - 1 + i) 

since the second term in (jB.lh converges to zero as n — )• oo. The second 
equality in (IB.SP then follows in a straightforward way (e.g. by representing 
the sum of the right hand side of (1B.6P by a hypergeometric function, see 
for instance [T]). The first equality is then trivial since by definition 

oo 

XI X] qkini,...,nk-i) = 1. (B.7) 

fc=2 niH hnfe_i=n— 1 

□ 



Proof of Theorem \4-l\ Due to properties of the metric space (7~, d) (see 
[I1])[2I])) it is sufficient to show that the sequence 

{i^n{{T € T : To is a left subtree of (B.8) 

converges for any finite tree tq. We will write fn(To) as a shorthand for (IB.Sp . 
We proceed by induction on |ro|. Since every tree has the single rooted edge 
s as a left subtree, (IB.SP clearly converges when |ro| = 1. Assume that it 
converges for all finite trees r for which |r| < and denote the limits by 
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i^(r). Take a tree tq such that |ro| = + 1. Denote the degree of the nearest 
neighbour of the root in tq by k and decompose tq as tq = * n * • • • * Tk-i- 
To simphfy notation, we define Ti = s for i > k, such that fn(Tj) = 1 when 
i > k. From the Markov branching property we find that 

g=k ni-\ \-ni_i=n—l j=l 

(B.9) 

see Fig. [71 We need to take the n — >• c« hmit of the above equation. First 
note that 

e-i 

niH f-n^-i j=l niH hn^-i 

=n— 1 =n— 1 

and thus, by Lemma [B.2l and dominant convergence we can swap the n — t- oo 
hmit and the sum over i in (IB.Op . Therefore we will now only consider one 



"3 




® 



Figure 7. A diagram explaining (jB.Qp . In this case fc = 4. The 
dotted circles and lines indicate all the possible trees which have 
To = Ti*T2* T3 as a left subtree. For i < fc — 1, Ui denotes the total 
volume of the tree which contains as a subtree and for i > k, 
rii denotes the total volume of each of the other trees which are 
attached to the nearest neighbour of the root. 
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term in the sum over i 

e-i 

^ qi{ni,...,ng_i)Y\_i^nATi. 

niH hn^-i 1=1 

=n-l 



r(£-2+l^)r(l-a)r(n) ^ ar(n,-Q) 

r(i^)r(^)ar(n-a) ^ iir(i-a)r(ni + i)''"'^^*^ 

ni>l, Vi 



X 1 — a — a + a 



^-1 

rii 



(B.IO) 

There is always some index J such that nj > [n — !)/(£ — 1) in the above 
sum. First, consider the contribution from the constant term 1 — a — a inside 
the parentheses in (jB.lOp . call it Si(n). Using (13.7P we find the upper bound 

^ ^ m=e-2n2-i \-nt_i=m.j=2 ^ ' \ J I 

ni>l, Vi 

where C\ {€) > is independent of n and we renamed m = n—\ — nj . Using 
p.ip , (|3.6p and a generating function argument as in Lemma IB. II we find 
that the summand in the sum over m is 0(m~'^). Therefore Si(n) = 0(n~'^) 
which goes to zero as n — )• oo. 

Next, consider the contribution from terms in the sum inside the 
parentheses in (jB.lOp for which i ^ J, call it S2(n). It can be estimated 
from the above by 

^ m=l-2n2-\ hn^_i=m j=2 ^ ' \ 3 I 

ni>l, Vi 

(B.ll) 

where Ciii^ > is independent of n. As before, we find that the summand 
in the sum over m is 0{rri~°^^^^ and thus S2(n) = 0(n~") which goes to 
zero as n — 7- oo. 

Finally, consider the contribution to (|B.10p from the term in the sum 
inside the parentheses in (jB.lOp for which i = J and for which X^^y j '^i > ^ 
where A> (. — 2. Call this contribution S3(n, A). We find that 

' n — 1 \ " 1 TT ^-'^ ('^i ~ 



' ^ V^-ly ^,m + l ^ J-J-r 1-a r + l 

^ ^ m>A n2H |-n(>-i=mj=2 ^ ' \ / 

ni>l, Vi 

(B.12) 

where C^{1) > is independent of n. It is evident from (jl.ip and (j3.6p 
that the sum over m is convergent. Therefore, limsup„_^3o £3(71, ^) — >• as 

23 



^ — >■ oo. Thus, 

limsup ^ g£(ni, . . . ,n£_i) i/rij (Tj) = limsupE3(n,^) 

niH \-ni_i j=l 

=n-l 

aT{£-2 + ^) ^^ ^ 1 ^ ^fp^ aT {nj - a) 

+ Y(^)T(e) l^""^^^^ ^ m+1 ^ ii r(l-a)r(n, + l)''"^^^^'^ 

V a ; V^; j=i m=^-2 n2+-+ni_i=m j=l ^ V V J i ; 

ni>l, i^J, nj=0 jV-^ 

gr (£-2 + i^) ^ ^ ^_ ^ ar(n,-a) 

X;;;^ r(i^)rm Z^^^^^^ ^ m + i ^ r(i - a)r(n, + 1)"""^^^^'^' 

ni>l, i^J, nj=0 jjtJ 

(B.13) 

It is clear the the same holds for the lim inf and therefore (jB.Sp converges for 
tq. Furthermore, the last expression (jB.lSP along with the estimate ()B.12p 
on S3 characterizes the measure u. The distribution of the vertex degrees 
on the spine also follows from ()B.6p . □ 

Proof of Lemma \5.2l The proof involves deriving and solving a differential 
equation for certain generating functions. We start by defining a generating 
function for the quantity of interest 

00 

_R=1 i 

(B.15) 

and we define the probability generating function for the volume of a graph 
ball of radius R which is centered on a vertex on the spine and intersects 
the outgrowths from that vertex 

00 

fniz) = ^^^[{{n,T2,...)■.J2\^R{n)\=j])zK (B.16) 

j=0 i 

(B.17) 

It is furthermore convenient to define the following rescaled probability gen- 
erating function, with respect to f„, of the volume of a ball of radius R 

r (n — a) 



= r(l-„';r(n+l) E--({^HBH(r)|=j}).MB.18) 

^ ^ ^ ' ^ j=0 

(B.19) 

and the corresponding generating function over n 

00 

Gr{zX) = J]Aij,„(z)C. (B.20) 



n=l 
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(B.21) 



We furthermore define 

Vr{0 = d,GR{z,C)\z=i and (B.22) 

oo 

QAO = ^VdOx''. (B.23) 

R=l 

It is straiglitforward to show that 

1 _ (1 _ 

Gr{1,0 = (B.24) 



for all > 1. Using dO]), (g^j) and ((HJD we find the relation 

fR{z) =a aGR{z, C))"^ dC (B.25) 

Jo 

and therefore, it follows from ()B.24p that 

^ \BR{n)\)^ = f'^il) = a(l - a) £ Vr{C'){1 - CT-'^-'dC' (B.26) 

and ^ 

H{x) = a{l-a)l Q^iOil-CT'^'^dC'. (B.27) 

JO 

Thus, knowing Qx{0^ we can expand H{x) and find its coefficients, which 
completes the proof. We proceed as follows: Using the Markov branching 
property we can derive the following recursion 

fc=2 ^ ^ „i+...n,_i=n-l \ 0=1 I i=l 

ni>l, Vi 

(B.28) 

for i?, n > 2. In terms of the generating function Gr{zX)i (|B.28P can be 
written as 

0.(.,C)^ -('-"-^'°) (B.29) 

o 

for i? = 1 and 

d^GRiz,C) = z(^{l- aGR-iizX))^ +ad^GR^i{z,C) J^^ (l - aG^j_i(z, C')) < 

(B.30) 

for R > 2. By differentiating the above with respect to z, rearranging and 
differentiating again with respect to C we get the following recursion 

ViiO = ^ ^ (B.31) 

a 

and 

(1 - c)i4'(c) - (1 - a)vuc) = (1 - 0(1 - (1 - cr)vii_AO 

+ (1 - a)(2(l - 0" - l)FA-i(C) + a(l - a)(l - C)"-Vfi_i(C). 
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Writing this in terms of the function QxiC) we find the differential equation 
(1 - C) (l - ^ - (1 - 0") g^(C) + (1 - a) (^2(1 - 0" - 1 + ^) QUO 



+a(l-a)(l-Cr-^Qx-(C) = 
with initial conditions 

Q^(0) = and Q^O) 



X 



1 — X 



(B.32) 



We will solve this differential equation with straightforward methods and by 
plugging the solution into ()B.27p we can determine the function H{x) and 
extract the coefficients of its power series. We start by making a convenient 
change of variables by defining y{Q = (1 — C)°(l — x~^)^^ and 



Pcc{yiC)) = QAC)-^ + 



a 



a{x — 1) 



(B.33) 



Then Px{y) satisfies the following inhomogeneous, hypergeometric differen- 
tial equation 



ay{l - y)P'^{y) + {a-a-{a-2a + l)y)P'Ay) + 
(a — a + l)(a — a) a — a a{l — a)x 

o y ~t~ 



a(l — a) 



a 



Px{y) 



a a'^{x — 1) 



with initial conditions 



P. 



X 



X- 1 



P' 



X — 1 



0. 



(B.34) 



(B.35) 



A basis of solutions to the homogeneous equation is 



ui{y) = {-y)^ and U2iy) = il-yy 



i-g /I — a a — a y 



2F1 



a 



,1, 



a 



The Wronskian is given by 
W{y) = 



(B.36) 



ui{y) U2{y) 
u'liy) u'^iy) 



U, . a — a. 

-(-y) ° (1 - y) 



a 



The Green's function corresponding to (IB.34P is defined as 

1 



F{y,v) 



av{l — v)W{v 
The solution to (iBrM]) and ([BlSS]) is then 



U2{v)ui{y) + ui{v)u2{y) ■ 



Px{y) 



y ( {a — a + l){a — a) a — a a(l — a)x\ 
F{y,v) { ^ V h — I av. 



a 



a'^{x — 1) 



(B.37) 
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Using ([B:2711 and (|B:33]1 we find that 

a — a Q 

H{x) = (l-a)(^i^^ " j^P^{y)dy 



ail — a)x 
^ ' (B.38) 



(a — a){2a — a)(l — x) 
Plugging the solution ()B.37p into ()B.38P and using the formula 

f{s)dsdx= I f{s){t-s)ds (B.39) 



we obtain, with some rewriting, 

, , 2 o- — a , , £l f 1 — ft Q^~t~l 1 ^ , , _ 

H[x) = (1 — a)a x a (1 — x) " / 2F1 I , 1, ; v \ v « [l — v)~ 



l-x 



a a 



a x(l — x) a f + (1 ~ ft)(ft — a + axjf 



+ (1 + a — a){ax — 2ax — 2a + 2a)v + (a — a) (a — 1 — 2a) (1 — x) dv 
a{l — a)x 

^ {a - a){2a - a){l - xY ^ ' 

It is possible to simplify this even further. By considering solutions for a 
few choices of convenient parameters (e.g. D = 3 or D = 00, 7 = 1) one 
immediately guesses that 

H{x) = l-x-'U^i 1, , ;x -1 . B.41 

\ — a — a \ \ a a J J 

It is then straightforward to check that this is true by rearranging and 
differentiating ()B.40p . The result follows by expanding ()B.4ip . □ 

Proof of Theorem \5.SX Let \{R) be a positive function with the property 
that 

We will show that there exist constants Ci and C2 and for z^-almost all trees 
T a constant Rt > such that 

Ci(log(i2)-ii2)i/" < \Bn{T)\ < C2{\{R)RY''' (B.43) 

for all R > Rt- In particular we can choose A(i?) = (log(i?))'' for any rj > 1 
which is sufficient to obtain the desired result. 

On an infinite, rooted tree with a single spine, denote the vertex on 
the spine at distance n from the root by Sn, n > 1. Let (X„(r))„ be 
a sequence of random variables corresponding to the number of leaves at- 
tached to the vertices si,S2,... of T. Define Sfi{T) = '^^iXi{T). Then 
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\Bji(T)\ = S'/j_i(T) + R and it is clearly sufficient to prove the above in- 
equalities for Sr{T). Begin with the lower bound. Take k,9 > 0. Using 
Markov's inequality, the independence of the Xi 's and Equation ()4.8p we get 

u{Sr{T) <k) = V (e-^^« > e-^'^) < e'^ ((e-^^^^^)),)'' 

Now choose k = K{\og{R))-^/'^R^/"' and 6* = Note that 

= r (1 + 0(0)) (B.44) 

and therefore for R large enough 

v{Sr{T) < K{log{R))-^/^R^/^) < Cge"^"" = CsR'^"^ (B.45) 

where C3 is a positive constant. Choosing K = Ci small enough we see that 

00 

HSr{T) < Ci(log(i?))-V7i?i/-) < 00 (B.46) 

R=l 

and therefore, by the Borel-Cantelli lemma, there exists a constant Rt such 
that Sr{T) > Ci(log(ii))-V°/?i/f» almost surely for all R > Rt- 

The upper bound follows from Theorem 2 in [16j which states, for our 
purposes, that the probability of the event 

Sr{T) > aR, for infinitely many R (B.47) 

is zero if the sum 

00 

HXk > aR) (B.48) 

R=0 

converges, where qr is a positive sequence with the property that ur/R — )• 00 
as — >• cx). According to (14. 8|) 

HXk > aR) < C^a^'' (B.49) 

for a suitable constant C4. Choosing qr = (A(i?)ii)^/°, where X{R) has the 
properties stated above, completes the proof. □ 

Proof of Theorem \5.4\ For a tree T, define the generating function 

00 

QT{x) = J2PT{m-x)l (B.50) 

t=0 

From [14j Lemma 7 and Equation (6)] we have the following inequalities for 
any fixed caterpillar T and any integer R > 

^ ^ ,^ < Qt{x) <R+ (B.51) 

-^+x + x\Br{T)\ x\Br{T)\ 
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Using (|B.43p for a suitable choice of X{R) we get z/-almost surely the in- 
equality 

^ <QTix)<R + —-—^——rj- (B.52) 



T^ + x + x(A(i?)i?)i/" -^'^ ' - xCi{log{R)'^Ry/ 

for all R > -Rj- and Rt large enough. Choosing R = [x we find that 
there are numbers Ki(T) and K2(T) such that z/-almost surely 

Ki{T)\{[x~^])-'^x-^ < Qt{x) < K2{T) \og{[x~^])x~^. (B.53) 
This yields the desired limit. □ 
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